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A hierarchy of voids: More ado about nothing 
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ABSTRACT 

We extend earlier work on the problem of estimating the void-volume function - the 
abundance and evolution of large voids which grow gravitationally in an expanding uni- 
verse - in two ways. The first removes an ambiguity about how the void-in-cloud pro- 
cess, which erases small voids, should be incorporated into the excursion set approach. 
The main technical change here is to think of voids within a fully Eulerian, rather than 
purely Lagrangian, framework. The second accounts for correlations between different 
spatial scales in the initial conditions. We provide numerical and analytical arguments 
showing how and why both changes modify the predicted abundances substantially. In 
particular, we show that the predicted importance of the void-in-cloud process depends 
strongly on whether or not one accounts for correlations between scales. With our new 
formulation, the void-in-cloud process dramatically reduces the predicted abundances 
of voids if such correlations are ignored, but only matters for the smallest voids in the 
more realistic case in which the spatial correlations are included. 
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1 INTRODUCTION 

The abundance of clusters and its evolution is a useful probe 
of the primordial fluctuation field, the subsequent expansion 
history of the universe, and the nature of gravity. This is, 
in part, because there is an analytic framework for under- 
standing how cluster formation and evolution depends on 
the background cosmological model (Gunn & Gott 1972; 
Press & Schechter 1974; Peacock & Heavens 1990; Bond et 
al. 1991; Sheth, Mo & Tormen 2001; Martino, Stabenau & 
Sheth 2009). 

If the clusters were identified in a galaxy survey, then 
it is possible to identify underdense regions - voids - in the 
same dataset (e.g. Kauffmann & Fairall 1991; Hoyle & Vo- 
geley 2004; Hoyle et al. 2005; Patiri et al. 2006; Pan et al. 
2011). As for the clusters, there exists an analytic frame- 
work for understanding void formation (Blumenthal et al. 
1992; Dubinski et al. 1993; van de Weygaert & van Kam- 
pen 1993; Sheth & van de Weygaert 2004; Furlanetto & 
Piran 2006; Patiri, Betancort-Rijo & Prada 2006), so the 
comoving number density of voids of radius R, and its evo- 
lution, provides complementary information about cosmol- 
ogy (Kamionkowski, Verde & Jimenez 2009; Lam, Sheth & 
Desjacques 2009; D'Amico et al. 2011) and gravity (Martino 
& Sheth 2009). Void shapes are interesting too (Park & Lee 
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2007; Biswas, Alizadeh & Wandelt 2010; Lavaux & Wandelt 
2010), but they are not the primary interest of this paper. 

Following Press & Schechter (1974), studies of cluster 
and void evolution relate the formation of an object to its 
initial overdensity. A cluster today is a region that is about 
200 times the background density, and it formed from the 
collapse of a sufficiently overdense region in the initial condi- 
tions. However, the overdensity associated with a given posi- 
tion in space depends on scale (in homogeneous cosmologies, 
the likely range of overdensities is smaller on large scales). 
So, to estimate cluster abundances, the problem is to find 
those regions in the initial conditions which are sufficiently 
overdense on a given smoothing scale, but not on a larger 
scale. This is because, if the larger region is sufficiently over- 
dense, then, as it pulls itself together against the expansion 
of the background universe and collapses, it will also squeeze 
the regions within it to smaller and smaller sizes. The frame- 
work for not double- counting the smaller overdense clouds 
that are embedded in larger overdense clouds is known as 
the Excursion Set approach (Epstein 1983; Bond et al. 1991; 
Lacey & Cole 1993; Sheth 1998). 

For voids - regions that today are about 20% the back- 
ground density - the problem is slightly more complicated, 
since one must account not just for the analogous void- 
in-void problem, but also for the fact that underdensities 
which are surrounded by sufficiently overdense shells will be 
crushed as the overdensity collapses around them. This void- 
in-cloud problem was identified by Sheth & van de Weygaert 
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(2004) , who also showed how one might account for both the 
void-in-void and the void-in-cloud problems in the language 
of the excursion set approach. 

However, their formulation suffers from an important 
drawback - they treat the identification of the overdensity 
associated with a cloud as a single scale independent num- 
ber. As they noted, it is easy to see that this is, at best, a 
crude approximation. Suppose that this number is that asso- 
ciated with the formation of a cluster. Then, their approach 
corresponds to eliminating from the list of all possible voids 
all those that are surrounded by an initially larger region 
which is destined to have collapsed and formed a cluster by 
the time the void they surround would have formed (were 
it not surrounded by this overdensity). This leads to the 
question of what to do with sufficiently underdense regions 
which were surrounded by regions which will not have col- 
lapsed completely by the time the void inside them forms, 
but that will nevertheless have squeezed the enclosed void, 
thus altering its size, and possibly even preventing its for- 
mation. 

To illustrate the magnitude of this effect, Sheth & van 
de Weygaert showed how the predicted void abundances 
change if one uses the overdensity associated with 'cloud' 
turnaround instead of collapse (the two differ by approx- 
imately a factor of 1.6 in initial overdensity). While the 
difference for big voids is small - big underdense regions 
are unlikely to be surrounded by even larger overdensities 
- the effect on smaller voids is dramatic. Although it is the 
largest voids which are most easily measured, and so most 
likely to place the most interesting constraints on cosmo- 
logical models, the uncertainty from not knowing precisely 
where the void-in-cloud problem becomes relevant is prob- 
lematic. (One way to view this problem is to note that large 
voids are exponentially rare, so to constrain cosmology re- 
quires large survey volumes. By having an accurate model 
of smaller voids, one potentially allows smaller surveys to 
place interesting constraints.) 

The main goal of the current paper is to present a for- 
mulation of the problem which resolves this drawback of 
the initial formulation. The key is to phrase the criterion 
for being a void in terms of the late time field - a void is 
the largest region in the late time field which is sufficiently 
underdense - and to then determine what this requires of 
the initial field. This means one must be able to relate what 
are often called Eulerian volumes in the late-time field, and 
Lagrangian ones in the initial field. Fortunately, this can be 
done within the Excursion Set approach (Sheth 1998; Lam 
& Sheth 2008). 

However, there turn out to be a number of subtleties 
along the way, which are related to one of the technical 
assumptions associated with the Excursion set approach. 
Strictly speaking, the overdensity associated with a given 
position and scale is correlated with the overdensity on all 
other smoothing scales as well. Therefore, if one plots this 
overdensity as a function of smoothing scale, then this looks 
like a random walk with correlated steps. Following Bond 
et al. (1991), most excursion set analyses, and the Sheth & 
van de Weygaert model for voids in particular, make the ap- 
proximation that the steps are uncorrelated, and they then 
assume that the resulting prediction will be a useful ap- 
proximation to that which one would have obtained if one 
had solved the (more physically relevant) correlated steps 



problem. Accounting for such correlations makes relatively 
minor changes to the cloud-in-cloud (or void-in-void) pre- 
dictions (Peacock & Heavens 1990; Maggiore & Riotto 2010; 
Paranjape et al. 2011). In what follows, we will show that the 
difference between the correlated and uncorrelated solutions 
is much larger for the void-in-cloud problem. 

In Section [2] we show how to cleanly resolve the void-in- 
cloud issue in the Excursion set approach. In Section [3] we 
use a numerical Monte-Carlo method to show that the un- 
correlated steps formulation is quite sensitive to this change 
- our solution to the void-in-cloud problem predicts far fewer 
large voids than do Sheth & van de Weygaert. On the other 
hand, the correlated steps formulation is almost completely 
unaffected by the void-in-cloud problem in the first place, 
and so is not sensitive to the change in our prescription. In 
other words, the void-in-cloud problem is a case in which the 
difference between correlated and uncorrelated steps matters 
greatly. A final section discusses some implications. 



2 A BETTER MODEL OF THE 
VOID-IN-CLOUD PROBLEM 

In this section, we show how a more careful statement of the 
void-in-cloud process leads to a slightly modified formulation 
of the problem in the Excursion set approach. In essence, this 
resolution of the problem combines the analysis in Sheth & 
van de Weygaert (2004) with that in Sheth (1998). 

2.1 Lagrangian vs Eulerian treatments 

In what follows, we will denote the Eulerian radius and 
volume of the void by R and V respectively (so that 
V = 47r7? 3 /3), and refer to Lagrangian length scales sim- 
ply through the associated mass m = p{4-KRl/3), where 
p is the comoving background density and _Rl is the 
Lagrangian radius which evolved into the Eulerian ra- 
dius R. We will also use s(m) to denote the variance 
of the linearly extrapolated density contrast when filtered 
on a Lagrangian scale corresponding to mass m: s(m) = 
(2tt 2 ) " 1 J °° dfc k 2 P(k) W 2 (kR L ) , where W(kR L ) is the filter 
and P(k) the linearly evolved matter power spectrum. 

The condition for being identified as a void of Eulerian 
size R at some time t is that the region of size R must be 
(a) less dense than some critical threshold (typically about 
twenty percent of the background density); (b) denser than 
this critical threshold value on all larger Eulerian scales. 
Sheth & van de Weygaert replaced these Eulerian conditions 
with Lagrangian ones. The Lagrangian region of mass scale 
M must be (ill) less dense than some critical density initially 
(typically, linear theory overdensity of —2.71); (hi,) denser 
than this on all larger mass scales; and (cl) not dense enough 
on these larger Lagrangian scales for this to have influenced 
the evolution of the initial void-candidate region sufficiently 
that it did not form a void at late times. 

Sheth & van de Weygaert argued that these require- 
ments correspond to two different barriers in the Excursion 
set approach. In the plane of (linearly extrapolated) initial 
overdensity versus scale, the first two requirements corre- 
spond to the first crossing of a barrier of constant height 5 V . 
(When extrapolated to the present time using linear theory, 
8 V — —2.71, approximately independent of the background 
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cosmology, and this fixes the void mass as M ~ 0.2pV, see 
below.) At issue is how best to implement the last constraint 
(cl): i.e., how to remove from the list of potential voids iden- 
tified in the initial conditions, those which would not also be 
identified as voids at later times (i.e. in the Eulerian field). 

Sheth & van de Weygaert assumed that this could be 
done simply by introducing a second barrier, B: Of the set 
of walks which first cross 8 V at the Lagrangian scale corre- 
sponding to the void mass M, one must remove those which 
crossed B before (i.e. at some mass m > M) they crossed 8 V . 
They assumed that B = 8 C — constant (and hence parallel to 
5 V ), where S c is the initial overdensity required for collapse 
at some time t, extrapolated using linear theory to time t 
(if S v = -2.71 then S c = 1.6860 

This would correspond to 
excluding regions which surround the void candidate region 
and have collapsed by time t, thus completely squeezing out 
the void. 

In our new approach, which allows us to account for re- 
gions which have only partially squeezed the void (and were 
not excluded by Sheth & van de Weygaert), it turns out to 
be more straightforward to not mix-and-match conditions 
in Eulerian and Lagrangian space. Rather, we will work en- 
tirely with the conditions (a) and (b) stated in Eulerian 
space, and we will draw on the analysis in Sheth (1998) to 
implement these Eulerian conditions in the (essentially La- 
grangian) plane of initial overdensity versus scale. 



2.2 The Eulerian treatment 

To begin, one notes that the spherical evolution model re- 
lates the Eulerian overdensity Anl = m/(pV), where m is 
the mass in a region that has volume V at time t, to the 
linearly extrapolated density contrast 8(t) by 



A NL (t) 



pV 



s c J 



(i) 



(Bernardeau 1994). If V = 4%R 3 /3 is specified, then this 
relation defines a curve Bv(m) which gives the value of the 
linearly extrapolated density contrast in a Lagrangian region 
containing mass m which evolves into the Eulerian volume 
V at time t: 



Bv{m) — S c 



-l/Bc 



(2) 



Notice that Bv{m) — > 5 C at m 2> pV, but that it decreases 
monotonically as m decreases, crossing at m — pV, and 
eventually crossing S v at m sufficiently smaller than pV. In 
addition, note that setting 5(t) = —2.71 and S c = 1.686 
makes Anl ~ 0.2, implying a void mass of M « Q.2pV for 
a void of Eulerian volume V. And finally, thinking of V as 
a parameter, note that decreasing V defines a sequence of 
nested curves whose limit, as V — > 0, is the constant barrier 
8 C : B?-to(m) — > 8 C . 



1 Strictly speaking, the exact values of S c and 8 V depend weakly 
on the cosmological parameters; the values quoted above corre- 
spond to the Einstein-deSitter case. Our focus, however, is on 
conceptual issues rather than numerical accuracy, and our dis- 
cussion does not depend on the exact numerical values of these 
parameters. 
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Figure 1. Excursion set model of voids. Dotted lines show the 
'barriers' associated with Eulerian volumes V4 > V3 > V2 > V\ > 
Vb : barriers for larger volumes fall more steeply. The horizontal 
line at 8 V = —2.71 shows the critical linearly extrapolated initial 
density for voids. Solid lines show two examples of random walks; 
both first-cross 8 V on the same mass scale S. Since neither walk 
crossed 8 C prior to crossing 8 V , Sheth & van de Weygaert would 
have assigned the same void mass and Eulerian volume to both 
walks. However, in our prescription, the (blue) one which falls 
monotonically with S is associated with a larger Eulerian volume 
(between V3 and V2), because its evolution is not modified by 
the void-in-cloud process: we would have assigned the same mass 
and volume to it as they did. The other (red) walk represents an 
overdensity on the Eulerian scale V2 (because V2 is first crossed 
at 8 > 0), but a void on the Eulerian scale just smaller than 
this (because the first crossing of the next shallower barrier will 
be at 8 < 8 V ). The evolution of this void has been modified by 
the collapse of the overdensity surrounding it. We would assign a 
larger mass to the 'wall' which surrounds the void, and a smaller 
mass and volume to the void itself, compared to Sheth &; van de 
Weygaert. Moreover, note that, for this walk, the first crossing of 
8 V is actually not so significant. 



The dotted lines in Figure \T\ show such a nested se- 
quence. Also shown are two candidate random walks (blue 
and red solid lines) which first cross 5 V at the same La- 
grangian mass scale S = s(M), so that By(M) — 5 V . Since 
neither of these walks exceeded 8 C prior to first crossing S v , 
Sheth & van de Weygaert would have assigned both walks 
the same Lagrangian mass and Eulerian volume. 

For us, the two walks are rather different void candi- 
dates. This is because the mass inside Eulerian V at time 
t is given by the value of s(m) at which the associated 
barrier Bv(rn) is first crossed (Sheth 1998). For the (blue) 
walk which decreases monotonically, the monotonicity in La- 
grangian 8 translates directly into a monotonicity in Anl, 
so that conditions (a), (b) as well as (aL), (bL) and (cl) are 
all met. For the void associated with this walk, we would 
assign the same mass and volume as would Sheth &c van 
de Weygaert. In particular, the Eulerian volume would lie 
between V3 and V2. 

However, for the other (red) walk, the non-monotonicity 
of S means that Anl is not monotonic either. More impor- 
tantly, although the predicted mass decreases monotonically 
with Eulerian V, it need not do so smoothly. Rather, on 
scales V where Bv is tangent to the walk, the predicted 
mass must jump downwards as V — > V — AV (i.e., as the 
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barrier is made shallower), because the value of s on which 
Bv-av is first crossed can be substantially larger than that 
on which Bv was first crossed. (In the Figure, this happens 
at about V2O This means that, for the entire portion of the 
walk between these two first crossing values (essentially, the 
value of s at which a barrier Bv is tangent to the walk, 
and the next larger value of s at which it pierces the walk), 
translating the Lagrangian 5 to an Eulerian Anl using equa- 
tion (fj} will not yield the correct answer. This, in essence, 
is why an approach based purely on Lagrangian quantities 
will not work: one must use Eulerian quantities. This sharp 
transition in mass (and hence Eulerian density) at nearly 
constant Eulerian volume has a clean physical interpreta- 
tion in terms of a dense "wall" surrounding the underdense 
void. In the current instance, we would assign the void an 
Eulerian volume that is essentially V2, with mass interior to 
the void given by the value of s at which By 2 intersects the 
walk. And we would interpret the value of s at which By 2 
was tangent to the walk as the mass at the void wall. 

For this particular walk, the two masses can be quite dif- 
ferent indicating that the Eulerian void should be rather well 
delineated by the surrounding Eulerian overdensity. This is 
precisely the type of void that is easiest to identify obser- 
vationally - so it is worth noting that it is for just such 
voids that our algorithm can differ substantially from that 
of Sheth & van de Weygaert. The voids on which we would 
agree are those associated with walks that are similar to 
the monotonically decreasing walk in Figure [T] Since these 
correspond to voids for which there is no obvious defining 
'wall', they are hardest to define observationally. 

To highlight how different our algorithm is, it is worth 
contrasting the role played by S v in the two approaches. In 
the old one, first crossings of S v are fundamental, because 
they give the superset of Lagrangian void candidates from 
which one discards those which first crossed 8 C , on the basis 
that they represent voids that would have been crushed out 
of existence by Eulerian evolution. One might have thought 
that, because it accounts for the squeezing rather than com- 
plete crushing of these regions due to Eulerian evolution, our 
modification mainly serves to reduce the predicted volumes 
of the ones which remain. While this is correct, there is a 
subtlety. 

As Figure [T] shows, if the first crossing of 5 V happens to 
lie in a region where the 5 — Anl mapping of equation fT} 
does not apply, then it is simply not as important as sub- 
sequent crossings of S v . E.g., suppose the spike in the walk 
were higher, so that it crossed above Bv 2 for a while, before 
dropping down to and zig-zagging around 5 V a few times. 
Then the Eulerian region just within V2 would not be a void 
(because the walk crossed By 2 above S v ), but one of the sub- 
sequent zig-zags around S v might actually be the one which 
first crosses an Eulerian By, and so represents a squeezed 
Eulerian void. This one would certainly have a smaller vol- 
ume than that given to the initial first crossing candidate 
by Sheth-van de Weygaert, but clearly, although 8 V plays an 
important role, the first crossing of S v is not necessarily the 
most relevant one. The fact that the first crossing of 5 V is no 
longer so important is one reason why we have been unable 
to derive an analytic expression for the distribution of void 
volumes associated with our new formulation of the void- 
in-cloud problem. We discuss this further in the Appendix. 



It is, of course, straightforward to implement our algorithm 
numerically, and we describe this in the next section. 

But before we do so, we note that our new approach 
helps alleviate one unphysical feature of the old model. 
Namely, in the Sheth-van de Weygaert approach, the volume 
fraction covered by voids is 55 C /(5 C + \S V \)- Since 5 C ~ 1.686 
and \8 V \ ~ 2.71, this 'fraction' is nearly 2. It is easy to see 
that this fraction must be smaller in our new approach, be- 
cause we would assign a smaller Eulerian volume to each of 
the Sheth-van de Weygaert void candidates (in some cases, 
this volume is vanishingly small). We show below that the 
associated void covering fraction is 1.17; i.e., although it is 
still greater than unity, the problem is now 20% rather than 
100%. 

2.3 Correlated vs uncorrelated steps 

We expect our model predictions to depend on whether or 
not the steps in the random walk are correlated. For walks 
with uncorrelated steps, the solution to the two-barrier La- 
grangian void-in-cloud problem S v -S c is quite different from 
that for the single S v barrier void-in-void problem; it has far 
fewer small voids (Sheth & van de Weygaert 2004). We ex- 
pect our purely Eulerian void-in-cloud algorithm to produce 
even smaller voids, so that all three estimates of the void 
distribution should differ substantially from one another. 

However, we expect these three estimates of void abun- 
dances to be rather similar for walks with correlated steps. 
This is because correlated steps generally result in smoother 
walks. Indeed, Paranjape et al. (2011) have recently shown 
that the limiting case of completely correlated steps, in 
which the walk height on one scale S completely specifies 
its height on all other s via 5(S)/VS = 5{s)/yG, actually 
provides a useful way of thinking about the single barrier 
problem. In this limit, walks do not zig-zag at all, which, 
in the present context means that the void-in-cloud prob- 
lem never arises, so the solution to the single barrier case 5 V 
would be the same as that for the purely Lagrangian (Sheth- 
van de Weygaert) formulation of the void-in-cloud problem 
(since no walks will have crossed S c prior to crossing S v ). 
Since our algorithm is basically the same as Sheth-van de 
Weygaert for smooth monotonically decreasing walks, the 
prediction associated with our Eulerian void-in-cloud for- 
mulation would also reduce to the first crossing distribution 
for the single barrier of height 8 V . We expect to see dif- 
ferences between these three cases as we move away from 
the completely correlated limit. But since Paranjape et al. 
have already shown that this limit is essentially exact for the 
most massive objects, we only expect to see differences for 
low mass voids. Walks with uncorrelated steps are far from 
this limit, so in this case we expect to see differences appear 
at larger masses. The next section shows that the predicted 
importance of the void-in-cloud effect does indeed depend on 
whether or not one accounts for correlations between steps. 



3 NUMERICAL (MONTE-CARLO) SOLUTION 

Before we show the numerical Monte-Carlo solution of our 
algorithm, note that although the description above is gen- 
eral, it simplifies considerably for power spectra with P(k) oc 
k n with n = -1.2. In this case, s oc m- (n+3)/3 oc m _3/5 ~ 
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Figure 2. Monte-Carlo solution of various Excursion Set based 
predictions for void abundances. Filled circles show the first cross- 
ing distribution of a single barrier of height S v ; open circles show 
the distribution of the subset of walks which did not first cross 
(5 C ; histograms show the distribution associated with our new al- 
gorithm. The black symbols and cyan histogram are for walks 
with uncorrclatcd steps, while the magenta symbols and his- 
togram (which lie very close to each other) are for walks with 
correlated steps (see text for details). (For clarity, we only show 
error bars for the open circles in this case.) The differences be- 
tween the symbols and the corresponding histograms are much 
more pronounced for walks with uncorrelated steps. Solid curves 
show the corresponding analytic solutions for walks with uncorre- 
lated steps, for the single barrier and two-constant barrier cases. 
Dashed curve shows 2 times the distribution Sfo(S) derived in the 
Appendix (equation I A 131 1 . and provides an excellent description 
of the cyan histogram. Dotted curve shows the expected solution 
for walks with completely correlated steps, which describes our 
results for correlated steps rather well. 



m _1//<Sc , so the barrier shape becomes linear in s, and this 
simplifies the numerical analysis considerably. For this rea- 
son, we have chosen to present results for this case first. We 
show results for a CDM power spectrum at the end of this 
section. 

Our Monte-Carlo algorithm works as follows. For a 
walk with uncorrelated steps (corresponding to a filter that 
is sharp in fc-space), we accumulate independent Gaussian 
random numbers gi with a fixed variance As, ,5( uncorr ) — 

53^=i 9ii an< i record the step at which the barrier 5 V was 
first crossed as well as the step at which 8 C was first crossed. 
The distribution of s at which <5„ was first crossed repre- 
sents the solution to the void-in-void problem; that of the 
subset of walks for which S v was first crossed prior to ever 
crossing 8 C represents the Sheth-van de Weygaert algorithm. 
The black filled and open symbols in Figure [2] show these 
two distributions, respectively: the solid black curves go- 
ing through them show the associated analytic expressions 
(from Bond et al. 1991 and Sheth & van de Weygaert 2004; 



equation IA4fl . The agreement indicates that the numerical 
algorithm works. 

The cyan histogram shows the result of implementing 
our algorithm as follows. For a walk that crossed S v at least 
once, we choose all steps prior to the first crossing. At each 
step j we have a pair (5j,Sj) which together define an Eu- 
lerian volume Vj. (In more detail, the value Sj gives a mass 
rrij, and insertion of Sj in equation fT} yields rrij/pVj.) We 
call the smallest value of Vj associated with the walk so far 
Vmin. If Vmm = 0, we stop - this would only have happened 
if the walk exceeded S c . Since this would mean the void can- 
didate has been crushed out of existence, we eliminate the 
walk from the list of void-walks. If V m m > 0, then asking 
that it be a void sets a mass M m i n ~ 0.2pV m m, which deter- 
mines an Smin- (Typically, this value is larger than that on 
which the walk first crossed below S v .) So we check if the 
walk remains below the barrier _By- min (m) (of equation^ for 
all s(m) < Smin- If it does, we store this value and proceed 
to the next walk. If it does not, then we select the first of 
all steps larger than Smin which are below S v , and repeat 
the algorithm above until a void is identified, or until Smi n 
becomes sufficiently large that the associated void size is 
negligibly small. 

In the large mass (or volume) regime where the S v (no 
void-in-cloud) and 8 V -S C (Lagrangian void-in-cloud) distri- 
butions are similar, our algorithm predicts about a factor of 
2 fewer voids. On smaller scales, where the S v -S c prediction 
is dropping sharply, ours predicts more voids - though it 
is still about a factor of 3 smaller than when the void-in- 
cloud problem has been ignored altogether. This quantifies 
the discussion at the end of the previous section about the 
expected differences between these three ways of estimating 
void abundances. 

It turns out to be interesting to classify the voids iden- 
tified by our new algorithm in terms of the number of times 
we had to loop through the algorithm. This is because, 
in the Appendix, we describe an analytic estimate of the 
fraction of walks fo(S) for which a void is identified after 
only a single pass through the algorithm. This estimate is 
in good agreement with the fraction of such walks in our 
Monte Carlos (not shown). Curiously, multiplying this an- 
alytic estimate (equation IA13|) by a factor of 2 provides an 
excellent description of the full set of void walks. This is 
shown as the dashed line in Figure [2] We have not found 
a simple derivation of why this should have been the case. 
Integrating 2/o(S) numerically over all S gives 0.234 as the 
Lagrangian volume fraction; this is in excellent agreement 
with our Monte-Carlos. (The corresponding Eulerian vol- 
ume fraction is a factor 5 larger, i.e. 1.17, as mentioned pre- 
viously.) 

The magenta symbols and histogram show the corre- 
sponding results for walks with correlated steps. In prac- 
tice, we transformed each walk with uncorrelated steps into 
one with correlations by applying smoothing filters of differ- 
ent scales following Bond et al. (1991): we apply the fil- 
ter W(kRi J ) to the same set of numbers gt as above to 
get ^ corr ' = giW{k i R\ J j). Here Rl is the Lagrangian 
length scale related to mass m by m = (4tv /S)pRl. In 
this case, the correlation depends on the form of the fil- 
ter and on the shape of the initial linear theory power 
spectrum P(k), since one needs to know which values of 
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Figure 3. Monte-Carlo distributions for walks with correlated 
steps with the same format as in Figure[2] except that now Monte- 
Carlos used a ACDM P(k) and Tophat smoothing filters. The 
solid and dotted curves are the same as in Figure [2] 



kj and R^j to associate with the j'-th step. Once a power 
spectrum and filter are specified, this can be done by in- 
verting the relations jAs — (2-k 2 )~ 1 J* 3 dk k 2 P(k) and 
jAs = (27T 2 )- 1 / Q °° dkk 2 P(k)W 2 (kR Ijj ). We used a Gaus- 
sian smoothing filter W(kR) = e~ {kR)2/2 and P(k) oc fc" 1 ' 2 . 
We then subjected each correlated walk to the same analysis 
as for the uncorrelated walks. 

Notice that, in contrast to when the steps were uncorre- 
lated, now the three ways of estimating void abundances all 
give almost the same answer. The sense of the differences 
which are beginning to appear at small masses is easily 
understood: ignoring the void-in-cloud problem altogether 
over-estimates the abundances relative to the Lagrangian 
void-in-cloud treatment. This is only a small effect because 
a correlated walk which crosses 5 V is much less likely to 
have crossed 5 C than an uncorrelated walk. Stated differ- 
ently: most walks which crossed 5 V didn't go into the dis- 
allowed (> 5 C ) region anyway, so removing them makes lit- 
tle difference. In turn, the Lagrangian void-in-cloud analysis 
slightly overestimates the abundances relative to our Eulc- 
rian void-in-cloud algorithm, because it only eliminates the 
voids that got completely crushed, but does not alter the 
sizes of those that got squeezed a little. Therefore, it tends 
to overestimate the sizes of the voids, but this only becomes 
a significant effect for rather small voids. 

By a curious coincidence, the first crossing distribution 
for walks with "completely correlated" steps in the presence 
of a single constant barrier of height 5 V 



sf!; cc, (s) = 



1 |&,| 



(3) 



(Paranjape et al. 2011), shown by the dotted curve, pro- 
vides a rather good description of our predicted distribution. 



Recall that this prediction assumes the walks are perfectly 
smooth, so there is no void-in-cloud problem to begin with. 

First crossing distributions for walks with correlated 
steps are relatively insensitive to shape of the underlying 
power spectrum or the smoothing filter (Bond et al. 1991; 
Paranjape et al. 2011). This is also true for the problem 
studied here: Figure shows the result of using P(k) for a 
flat ACDM model with (a 8 ,fl m ) = (0.8,0.27), and Tophat 
smoothing filters. Note that the void-in-cloud problem only 
becomes noticable for small voids - but that it is more not- 
icable than it was for the Gaussian filtered walks (compare 
filled and open circles here and in previous Figure). In ad- 
dition, the solution is now noticably different from that for 
'completely correlated' steps. The sense of both these trends 
is easily understood from the fact that Gaussian smoothing 
is known to produce smoother walks than Tophat (Paran- 
jape et al. 2011), so the results here are intermediate between 
those for sharp-fc and Gaussian smoothing. 



4 DISCUSSION 

We have presented what we believe to be a better Excursion 
Set treatment of the void-in-cloud problem (Section and 
Figure [TJ. In addition to accounting for the fact that some 
voids can be crushed completely if they are surrounded by 
an overdensity which collapses around them, our Eulerian- 
space based approach also accounts for those voids which 
are squeezed rather than completely crushed. We argued 
that voids which are being squeezed by their surroundings 
may be the easiest to recognize observationally, so our mod- 
ification is potentially an important one. In particular, our 
approach shows explicitly why, in some cases, a purely lo- 
cal Lagrangian-based prediction for the evolution yields the 
wrong answer; it can be thought of as an explicit demon- 
stration of how stochasticity in the mapping between the 
Lagrangian and Eulerian density fields can arise naturally. 

The Excursion Set statement of the problem involves 
random walks. We provided an analytic expression (Ap- 
pendix) for the predicted distribution of void sizes for the 
case in which the walks have uncorrelated steps, and showed 
that it was in good agreement with numerical Monte-Carlo 
solutions of the problem. (This expression suffers from a cu- 
rious 'factor-of-two' problem which we discuss briefly - but 
we leave an exact solution of it to future work.) This analy- 
sis suggests that the void-in-cloud process modifies the pre- 
dicted void size distribution significantly, so that our new 
treatment of it was necessary. 

However, this conclusion depends strongly on whether 
or not we account for the correlations between scales in the 
initial density fluctuation field. In contrast to what happens 
for uncorrelated steps, for correlated steps, we found that the 
change in void abundances due to this effect is negligible for 
voids that are larger than 14, where V* is the characteristic 
Eulerian scale associated with voids: V* = 5M*/p, where 
cr(M„) = |<5„|. For flat ACDM with {tt m ,a 8 ) = (0.27,0.8), 
this scale is V* ~ (1.4/i _1 Mpc) 3 at z ~ 0; larger voids have 
not been squeezed by their surroundings. Since voids iden- 
tified in most galaxy catalogs are typically much larger, it 
may be unnecessary to account for the void-in-cloud process 
when interpreting observations. In this case, the void size 
distribution is quite well approximated by that of a single 
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barrier, for which good analytic approximations are avail- 
able (Paranjape et al. 2011). 

Although our results provide increased understanding 
of void abundances and evolution, a number of issues must 
be addressed before they can be used to provide useful con- 
straints on cosmology. First, the correlated walk problem 
is known to underpredict the abundances of clusters (Bond 
et al. 1991). Paranjape et al. (2011) describe at least three 
possible resolutions, which have to do with the fundamen- 
tal assumptions which the excursion set approach uses to 
relate the first crossing distribution with halo abundances. 
Presumably, this problem, and hence the potential resolu- 
tions, also apply to voids. Second, we must include a model 
for transforming our knowledge of voids in the dark mat- 
ter distribution to underdensities in the galaxy distribution. 
This will require applying the analysis of Furlanetto & Piran 
(2006) to our new formula for void abundances, perhaps ac- 
counting for the fact that the voids have non-trivial internal 
density profiles (Patiri et al. 2006, following Sheth 1998). 
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APPENDIX A: THE FIRST CROSSING 
DISTRIBUTION FOR WALKS WITH 
UNCORRELATED STEPS 

In this appendix we sketch the derivation of the first cross- 
ing distribution fo{S) (for walks with uncorrelated steps) 
discussed in section 3, which counts the fraction of walks 
that survive one pass through our algorithm. The analy- 
sis is tractable when the void-in-cloud barriers are linear, 
which happens for a power spectrum P(k) oc k~ 12 . It is 
then convenient to think of the barrier Bv(jn) as a function 
of s = s(m), parametrized by the value 5* = s(M = 0.2pV) 
at which the barrier crosses the constant barrier S v . We 
use the notation Bs(s) to denote the void-in-cloud barrier, 
and equation @ translates to Bs(s) — S c — (8t/S)s where 
St = S c + \8 V \. 

We are after the fraction of walks which satisfy the fol- 
lowing conditions: 

• They first cross the barrier Bs(s) at s = S' < S, with- 
out having crossed 5 V before S' . 

• They first cross the barrier Bs+as(s) after this barrier 
has passed through 8 V , i.e., at s > S + AS. 

We evaluate the resulting fraction in the limit AS — > 0, and 
interpret it as ASfo(S). 

The first condition above requires us to compute the 
fraction of walks dS' J-b(S') which first cross Bs(s) in 
the interval s G (S' , S' + dS'), without having crossed 
8 V before. The second condition requires the fraction 
dsf B A(s\S',B s (S')) of walks that started at height B S (S') 
on scale S' and then went on to first cross Bs+as(s) in the 
range (s, s + ds). The distribution fo(S) is then given by 

ASf (S) = dS'T B (S') ds f B A(s\S' , B S (S')) . 

J0 JS+AS 

(Al) 

The distribution Tb (s) can be written in a form which allows 
a recursive calculation: we first count the fraction of walks 
/b(s) which first cross Bs at s < S, regardless of whether 
they crossed 5 V , and then subtract those which did cross 8 V 
prior to s. We then have 

F B (s) = f B (s)- /" ds'F v (s')f B {s\s',5 v ), (A2) 
Jo 

where J- V {s') (with s' < S) denotes the distribution of first 
crossing of 5 V without crossing Bs, and the integral in the 
second term is counting walks that reached 8 V at s' for the 
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first time without crossing Bs, and then reached Bs for the 
first time at s. A similar argument, with the roles of Bs and 
Sv interchanged, allows us to write 



Fv(s') = fv(s') — / ds" Tb{s") f v {s'\Bs,s") . 
Jo 



(A3) 



where fv(s') is the distribution of first crossing of 5 V whether 
or not it had first crossed Bs, and f v (s'\Bs, s") is the corre- 
sponding conditional distribution. Notice that, despite the 
compact notation, both the distributions J-b{s) and J- V (s) 
depend on the scale S which parametrizes the barrier Bs(s). 
Repeated substitution of Tb (s) in the expression for T v (s) , 
and vice-versa, gives rise to an alternating series, the suc- 
cessive terms of which describe walks which cross S v after 
more and more zigs and zags. 

As an aside, we note that when Bs(s) = S c for all s, 
independent of S (i.e., the Sheth-van de Weygaert model in 
which 8 C is parallel to S v ), then the integrals in the expres- 
sions above can be done analytically, yielding 

J"„(s) = J r S vdw(s, S v , 5c) = fv(s) - / c +t(s) + f v +2r(s) - . . . 

(A4) 

where c, v and T in the expression above denote 8 C , \5 V \ and 
St = 5 C + \S V \ (Lam et al. 2009; D'Amico et al. 2011). The 
distribution Tb in this case is given simply by interchanging 
the roles of <5 C and S v in T v . In this case, the first few terms 
dominate - walks with many zig-zags from one barrier to 
the other are rare - so truncating the series yields a good 
approximation to the full answer. In particular, when 8t 3> 
\8 V \ then the first term dominates for small S: for the largest 
voids, the void-in-cloud problem is irrelevant. 

However, when Bs{s) is a decreasing function of s, 
which eventually crosses S v , then truncating the series is 
dangerous, because close to the point where the two barri- 
ers cross, zig-zags are no longer large, so many can occur. 
The first crossing distribution then becomes very sensitive 
to the exact relation between the first crossing scale, and 
the scale at which the barriers cross. We are interested in 
the case when Bs(s) — S c — (St/S)s is a linear barrier. 
The expression for J- v above can then be solved exactly. 
Since the full proof involves some rather tedious integrals, 
we only present a sketch here highlighting the main ingredi- 
ents. Equations (|A2Jl and (|A3[> can be combined as 



M*') = /«(*') +Ii+2i ; 



(A5) 



where 



Ms') 



dsf v (s'\s,Bs(s))f B (s) 







T 2 {s') = I dsi f v (s'\si,B s (si)) 
Jo 

x / ds 2 /£f(s 

1 1 S2 i &v )Ms2). (A6) 

Jo 

The conditional distributions simplify as in the two-constant 
barrier case, due to the Markovian nature of the walks. Un- 
like this previous case, however, this time the single barrier 
distributions /„ and fs are fundamentally different from 
each other. Whereas /„ is the same as before for a single con- 
stant barrier, f v (s) = (27rs 3 ) -1/2 |c5„|e~' S " /2s , f B for a single 
linear barrier Bs(s) is now given by the Inverse Gaussian 



(Sheth 1998), 

f B (s) = fiG(s,Bs(s)) 



B s (0) 

(27Ts3)l/2 ( 



-B s (s) 2 /2s 



(A7) 



Since the distribution J- v (s) appears in the integral in T2, the 
basic strategy is to recursively use equation (| A5|) and solve 
for J- V (s). For example, the integral in X\ above reduces to 



= ( 2S c + \S V \) c _sl/2S c - ±(26 c + \6 v \) 2 (l-s/S) 



V2 



(A8) 



In practice we compute the first few terms of the recursive 
series, which clearly reveal a pattern that closely mimicks 
the one in equation (|A4|I above, but with a rescaled argu- 
ment for the single barrier distributions. We are left with 



-3/2 



-K/2S -r- 

-/"SvdW 



s/S 



,S v ,8c 



(A9) 

where J r s v dw(i, S v , S c ) was defined in (|A4p . For the inter- 
ested reader, the integrals appearing in the derivation of 
equation (|A9[) involve repeated use of the identities (for 
A,B > 0) 



1 dy 



V 3/2 (1 - j/) 3/2 
1 dy 1 

y 1/2 (i~y) 3/2 ' 



-AL b 2 — A + B _i M+B i2 

AB 



2y 2(1- H ) 



V27T _i(A+fl) 2 



B 



(A10) 



which can be proved using the relation 3.472(5) of Grad- 
shteyn & Rhyzik (2007). 

Using equation (1A9[) in equation (|A2[) leads to an ex- 
pression for J r s(5' / ), which can then be used in equation ()A1|I 
to obtain an expression for fo(S), after substituting for the 
conditional distribution /ba(s|5' / , Bs(S')) using 

fBA(s\S',B s (S')) = fi G (s-S',Bs+A S (s)-B s (S')), 

(All) 

which follows from the Markovianity of the walks, with f%Q 
defined in equation <|A7[) . Notice that, in the limit AS — > 0, 
the evaluation of Bs+as{s) — Bs{S') at s = 5" becomes 
proportional to AS, and hence the remaining expression for 
fo(S) can be evaluated at AS = 0. Unfortunately, not all 
of the resulting integrals can be performed analytically. It 
turns out to be better to use the series for J^vdw given in 
equation (1) of Sheth & van de Weygaert (2004), 



Tsvdw{t,Sv,5 c ) = ^2 p- sin 



(A12) 



which is equivalent to the one in equation (|A4[) (D'Amico 
et al. 2011). This allows us to bring the expression for fo to 
the following form 



Sfo(S) = 



-vl/2 



A-B 



(A13) 
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where 



A = v c v T e"° /2 ^V^erfc (j^A - v T e^ /2 l(u c , v T ) 



B — j-K sin 



j-K\8 v \ 



(A14) 



2_^ /2l/ * T / j 7T 



j 2 7T 2 Vt 



2v* 



+ V27T V T ) - V T III(j, Vt) 



(A15) 



and we used the notation {v v ,v c ,vt} = { j 5„ | , <5 C , 6t}/v~S 
and defined the integrals 

I(„ ,„ T )= r^expf-^-^ 



x erfc 



(^yr^), (M6) 



II(j, V T ) 



1 dy_ 

o Vv 



exp 



v T y 



A 2 (i-y) 



2 2i/f,i/ 



x erfc 



V2 



(A17) 



III(j, v T ) 



f 

Jo 



d.v 
x 
i 



■ exp 



jV(l-x) 



2k^x 



dy 



o (i-y) 3/2 



(1 — 2;?/) exp 



2 2 



x erfc 



V2 



2 i-y, 
(A18) 



which must be performed numerically. We have checked that 
the result (after keeping ~ 600 terms in the sum over j) 
accurately describes the fraction of walks which survive one 
pass through our numerical algorithm, as it should. 

As noted in the text, we find that 2 times fo{S) provides 
an excellent description for the full Monte Carlo distribution 
obtained by our algorithm. This is shown as the dashed line 
in Figure [2] In fact, if we write this factor of two as follows, 



Sf(S) = 2Sf (S) 



SMS) 

1 - 1/2 



S/o(S)XX'\ (A19) 



then the nth term in the sum above approximates the distri- 
bution associated with the set of walks which required n + 1 
loops through our algorithm. While this is highly suggestive 
of a resummation of "loops" , with the "tree- level" result be- 
ing fo(S), we have not found a simple demonstration of why 
this should be the case. 



